Semantic Annotation for Indexing Archaeological Context: A Prototype Development and Evaluation
نویسندگان
چکیده
The paper discusses the process of developing Semantic Annotations, a form of metadata for assigning conceptual entities to textual instances, in this case archaeological grey literature. The use of Information Extraction (IE), a Natural Language Processing (NLP) technique is central to the annotation process. The paper explores the use of Ontology Oriented Information Extraction (OOIE) methods for the definition of rich semantic-aware indices of archaeology documents. The annotation process follows a rule-based information extraction approach using GATE. In particular the report discusses a prototype development that adopts the core ontology, CIDOC CRM, together with an English Heritage archaeological extension, to inform and direct the information extraction effort. The prototype evaluation, supports the assumptions made, about the capability of the method to construct rich indices of grey literature documents empowered by Semantic Annotations.
منابع مشابه
A pilot investigation of information extraction in the semantic annotation of archaeological reports
The paper discusses a prototype investigation of semantic annotation, a form of metadata assigning conceptual entities to textual instances, in this case archaeological grey literature. The use of Information Extraction (IE), a Natural Language Processing (NLP) technique, is central to the annotation process while the use of Knowledge Organization System (KOS) is explored for the association of...
متن کاملConcept-based semantic annotation, indexing and retrieval of office-like document units
We present an ontology-driven approach to semantic annotation, indexing and retrieval of document units. This approach is based on a novel semantic document model (SDM) that we developed to make office-like document units be uniquely identified, semantically annotated with concepts from annotation ontologies and linkable across document boundaries. In the semantic annotation model that we propo...
متن کاملTraining Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object
Training management in a company may benefit of a better integration with competence management outcomes. This paper is about an initial exploration of this proposal. It proposes a specific approach to support the indexing and retrieval of training courses with regard to the professions’ target competences. This approach is grounded on Learning Object metadata, and semantic web (SW) technologie...
متن کاملSemantic Annotation, Indexing, and Retrieval
The Semantic Web realization depends on the availability of a critical mass of metadata for the web content, associated with the respective formal knowledge about the world. We claim that the Semantic Web, at its current stage of development, is in a state of a critically need of metadata generation and usage schemata that are specific, well-defined and easy to understand. This paper introduces...
متن کامل